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1  Introduction 

Pattern-directed  invocation  and  equality  reasoning  are  two  techniques  com¬ 
monly  used  in  automated  reasoning  systems.  This  paper  describes  a  problem 
that  arises  when  these  two  techniques  are  combined,  and  gives  a  solution. 

1.1  A  Simple  Example  of  the  Problem 

Suppose  we  are  reasoning  about  a  function,  /,  which  obeys  the  axiom 

V.x  /( 0,x)  >  0. 

A  typical  implementation  of  this  axiom  is  a  demon  with  the  pattern  /( 0,?x) 
and  a  body  that  asserts  /( 0,?x)  >  0.1  This  demon  can  be  thought  of  as 
instantiating  the  universally  quantified  axiom  above  for  appropriate  terms  in 
the  reasoning  data  base.  For  example,  when  the  term  /(0,c)  is  added  to  the 
data  base,  this  demon  is  invoked  and  asserts  /(0,c)  >  0. 

In  the  presence  of  equality,  however,  the  proper  conditions  for  invoking 
a  demon  become  more  complicated.  Suppose,  for  example,  that  the  term 
/(a,  b)  is  added  to  the  reasoning  data  base  and  the  equality  a  =  0  holds. 
In  the  usual  algorithm  for  pattern-directed  invocation,  nothing  will  happen, 
since  there  is  no  term  in  the  data  base  that  exactly  matches  the  pattern  of  the 
demon.  This  is  undesirable,  however,  because  we  would  like  the  reasoning 
system  in  this  situation  to  deduce  /(a,  b)  >  0,  which  follows  logically  from 
the  axiom  above.  We  describe  this  problem  as  a  lack  of  completeness  in  the 
pattern-directed  invocation. 

Note,  however,  that  if  the  term  /(0,6)  is  somehow  created  in  this  situa¬ 
tion,  the  desired  deduction  will  be  made.  The  term  /(0,i>)  exactly  matches 
the  pattern  of  the  demon,  which  then  asserts  /(0,  b)  >  0.  Since  f(0,b)  is 
equal  to  /(a,  6)  by  substitution  of  equals  (0  for  a),  equality  reasoning  further 
deduces  that  f(a,b)  >  0. 

Thus,  a  brute-force  approach  to  the  lack  of  completeness  in  standard 
pattern-directed  invocation  is  to  close  the  data  base  under  substitution  of 
equals.  This  approach  is  not  feasible,  however,  because  the  numbers  of  terms 
generated  grows  exponentially  with  the  number  of  equalities,  and  is  infinite 
in  the  case  of  recursively  defined  equalities. 

1  Variables  are  denoted,  as  usual,  by  the  prefix 


2  Pat  tern- Directed  Invocation 

The  algorithm  developed  in  this  paper  solves  the  completeness  problem 
by  generating  a  subset  of  all  possible  substitutions,  based  on  an  analysis  of 
the  patterns  of  existing  demons.  Furthermore,  the  algorithm  is  incremental. 
New  demons,  new  terms,  and  new  equalities  can  be  added  in  any  order,  and 
completeness  will  be  maintained.  Equalities  can  also  be  retracted. 

1.2  Outline  of  the  Paper 

Section  2  provides  additional  background  of  the  completeness  problem  and 
sets  the  stage  for  the  rest  of  the  paper,  including  defining  the  terminology 
that  will  be  used.  In  addition  to  pattern-directed  invocation  and  equal¬ 
ity  reasoning,  Section  2  introduces  a  third  component,  truth  maintenance, 
into  the  environment  in  which  the  algorithm  operates.  Among  other  things, 
this  means  that  completeness  needs  to  be  maintained  when  equalities  are 
retracted. 

Section  3  formally  defines  completeness  for  a  given  data  base  of  terms 
and  set  of  demons.  Examples  are  given  of  demons  with  simple  and  more 
complex  patterns  to  illustrate  the  subtlety  of  the  issues  in  the  most  general 
case.  Finally,  Section  3  defines  the  notion  of  a  transparent  demon  and  then 
proves  a  key  theorem  that  is  the  basis  of  the  algorithm  of  Section  4. 

Section  4  develops  an  algorithm  that  incrementally  maintains  complete¬ 
ness  given  new  demons,  new  terms,  and  changing  equalities.  The  devel¬ 
opment  starts  with  a  simple  generalization  of  the  standard  algorithm  for 
pattern-directed  invocation,  and  then  adds  special  processing  for  demons 
with  complex  patterns.  Section  4  also  discusses  termination  of  the  algorithm 
and  some  extensions  that  improve  its  performance. 

Section  5  concludes  with  a  discussion  of  alternate  approaches  and  related 
work. 


2  Background  and  Environment 

The  completeness  problem  described  briefly  above  first  came  to  our  atten¬ 
tion  when  we  began  to  use  Me  A  Hester’s  Reasoning  I'tility  Package  (UUP)  [5] 
in  our  research  related  to  reasoning  about  programs  [8j.  Among  other  facili¬ 
ties,  RUP  included  equality  reasoning,  a  primitive  pattern-directed  invocation 
mechanism  (demons  were  associated  with  operator  symbols),  and  a  truth 
maintenance  system.  We  used  these  facilities  to  build  up  a  library  of  demons 
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for  reasoning  about  various  algebraic  properties  of  operators.  Since  our  rea¬ 
soning  also  involved  asserting  and  retracting  equalities  between  operators, 
we  immediately  began  to  run  into  the  incompleteness  problem. 

In  our  successor  to  RUP,  called  BREAD  (for  Basic  REAsoning  Device),  we 
have  extended  RUP  to  support  a  full  pattern-matching  language  with  vari¬ 
ables,  and  have  implemented  the  complete  algorithm  for  pattern-directed 
invocation  described  in  this  paper.  BREAD  also  includes  further  evolved 
versions  of  RUP’s  truth  maintenance  and  equality  reasoning  components. 
BREAD  is  now  the  kernel  of  a  general-purpose  knowledge  representation 
and  reasoning  system,  called  FRAPPE  [2],  which  is  itself  the  foundation  of 
a  special-purpose  system  for  reasoning  about  programs,  called  CAKE  [7]. 

Pattern-directed  invocation,  equality  reasoning,  and  truth  maintenance 
are  common  components  of  many  reasoning  systems.  In  order  to  facilitate  the 
incorporation  of  our  algorithm  into  other  systems,  we  attempt  in  this  paper, 
as  much  as  possible,  to  abstract  away  from  details  that  are  idiosyncratic  to 
BREAD.  In  this  vein,  the  following  sections  summarize  the  essential  properties 
of  pattern-directed  invocation,  equality  reasoning,  and  truth  maintenance 
that  are  relevant  to  our  algorithm. 

2.1  Pattern-Directed  Invocation 

Pattern-directed  invocation  is  a  common  technique  used  in  reasoning  sys¬ 
tems,  wherein  a  given  procedure  is  to  be  applied  to  every  term  of  a  particu¬ 
lar  form  in  the  data  base.  The  procedure  is  associated  with  a  pattern.  The 
combination  is  typically  called  a  demon ;  the  procedure  is  called  the  body  of 
the  demon. 

Whenever  a  new  term  is  added  to  the  data  base,  it  is  matched  against 
the  pattern  of  each  demon.  If  the  pattern  matches,  the  body  of  the  demon  is 
applied  to  the  matching  term.  (It  is  often  convenient  to  also  supply  the  set 
of  bindings  to  the  pattern  variables  as  an  argument  to  the  body,  although 
this  can  be  computed  from  the  matching  term).  Similarly,  whenever  a  new 
demon  is  added  to  the  system,  its  pattern  is  matched  against  all  terms  in 
the  data  base,  and  the  body  of  the  demon  is  applied  to  each  matching  term. 
In  most  implementations  of  pattern-directed  invocation,  elaborate  indexing 
structures  are  used  to  make  these  matching  processes  efficient. 

In  general,  both  terms  and  patterns  can  be  hierarchical.  An  atomic  term 
is  a  single  symbol,  such  as  /  or  0.  Non-atomic  terms  are  built  up  by  applying 
operators  to  arguments,  recursively,  such  as  /(0,  6)  or  P(f(a,b),c). 
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4  Pattern-Directed  invocation 

Patterns  are  written  using  the  same  notation  as  terms,  except  that  vari¬ 
ables  may  appear  in  some  positions.  (Technically,  a  pattern  may  have  no 
variables.  This  degenerate  case  occurs  in  several  examples  in  Section  4.4.) 
Patterns  are  not,  however,  stored  in  the  term  data  base --they  are  only  as¬ 
sociated  with  demons. 

A  flat  pattern  is  a  list  of  terms  and  variables,  such  as  /(0,?.r)  or 
P(/(a,  6),  ?x).  A  flat  pattern  has  all  of  its  variables  at  “top  level”.  A  pat¬ 
tern  with  variables  at  other  than  top  level  is  called  a  nested  pattern.  For 
example,  P(f(?x,  ?y),  ?y)  is  a  nested  pattern.  The  pattern  /(?,r,?y)  is  called 
a  subpattern  of  ?y),  ?y). 

The  algorithms  in  this  paper  treat  all  the  positions  of  patterns  and  terms 
symmetrically.  Therefore,  there  is  nothing  to  prevent  the  use  of  patterns 
with  variables  in  the  operator  position,  such  as  ?/(().  b). 

The  notation  used  for  demons  in  this  paper  is  illustrated  below: 

(  /( 0,?x)  ,  A<.assert(<  >  0)  ). 

This  is  the  demon  introduced  in  the  simple  example  problem  in  Section  1.1. 
The  body  of  the  demon,  which  may  be  an  arbitrary  procedure,  is  applied  to 
a  matching  term  t.  Assert  is  the  primitive  that  makes  a  premise. 

The  demon  above  is  an  example  of  a  common  form  of  restricted  demon, 
in  which  the  body  is  simply  a  call  to  assert,  with  its  argument  constructed 
from  parts  of  the  input  term.  This  form  of  demon  is  usually  defined  using  a 
notation  that  makes  the  assert  implicit  and  use's  variables  to  define  the  body, 
as  illustrated  below: 

Rule  / ( 0,  ? x )  / (0,  ?.r )  >  0. 

With  this  form  of  demon,  the  bindings  returned  by  the  matching  process 
are  substituted  in  the  right-hand  side  of  the  rule  before  it  is  asserted.  In 
implementations  of  pattern-directed  invocation,  such  as  HKKAI),  which  allow 
arbitrary  procedures  in  demon  bodies,  this  rule  notation  can  be  compiled 
into  a  procedure. 

A  standard  algorithm  for  pattern-directed  invocation  is  given  in  Figure  1 . 
It  will  serve  as  the  starting  point  for  the  complete  algorithm  developed  in 
Section  4.  Note  that  in  Figure  1  and  elsewhere,  we  use  the  notation  n~b 
to  mean  “a  is  identical  to  b"  (i.e.,  they  are  spelled  the  same),  versus  o  —  b. 
which  means  “a  equals  />”  in  the  current  equality  relation. 
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1  function  match(p,  t,  B) 

|  Match  pattern  p  to  term  t  ivith  bindings  B.  ] 

2  case 

3  p  is  a  variable: 

4  if  p  is  bound  to  s  in  B 

s  then  if  s  =  t 

6  then  return  B  |  same  value  ]j 

7  else  fail  |  different  value  ] 

8  else  return  i?U  {p  <—  t}  [  new  binding  J 

9  p  is  a  pattern  p0(pi, ...  ,pn)- 

10  if  t  is  a  term  t0(t\, . . . ,  tn) 

n  then  for  i  in  {0,...,n}  do  B  <—  match(p,,  ti,  B) 

12  return  B  [  all  positions  match  ]] 

13  else  fail 

14  p  is  a  constant: 

is  if  p=t  then  return  B 

16  else  fail 

it  function  try(t,  p,d) 

[  Match  term  t  against  pattern  p  and  apply  body  d  if  successful. 
is  B  <—  match(p,  t,  0) 

19  if  match  succeeded 

20  then  apply  d  to  t  [and  B\ 

21  function  add-term(t) 

I  Add  new  term  t  to  data  base.  J 

22  for  each  demon  (  p  ,  d  ) 

23  try  (t,p,d) 

24  add  t  to  data  base 

25  function  add-deinon(p,  d) 

J  Add  demon  with  pattern  p  and  body  d.  J 

26  for  each  term  t  in  data  base 

27  try  (t,p,d) 

28  add  (  p  ,  d  )  to  demons 


Figure  1.  A  standard  pattern  directed  invocation  algorithm.  Note1  that  the  match 
procedure  returns  a  set  of  bindings  w  hen  it  succeeds.  Failure  to  match  at  any  level 
is  assumed  to  propagate  to  the  top  level. 
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2.2  Equality  Reasoning 

Reasoning  with  changing  equalities  involves  two  main  tasks.  The  iirst  task  is 
to  maintain  the  equivalence  classes  of  the  equality  relation  under  transitivity 
and  reflexivity.  These  classes  need  to  be  joined  or  split  when  equalities  are 
asserted  or  retracted. 

The  second  task  is  to  guarantee  that  all  the  variants  of  a  term  are  in 
the  same  equality  class.  A  variant  of  the  (non-atomic)  term  t  is  any  term 
derived  from  t  by  substitution  of  equals.  Variants  of  t.  are  therefore  equal 
to  t.  For  example,  f (g(a,  b),  c)  is  a  variant  of  /(//,  e),  given  the  equalities 
d  =  g(a,  b)  and  e  =  c.  The  task  of  keeping  variants  in  t  he  same  equality  class 
is  sometimes  called  the  congruence  closure  problem,  because  it  has  been 
shown  to  be  reducible  to  the  problem  of  computing  the  congruence  closure 
of  a  relation  on  a  graph  (for  a  formal  definition,  see  [6]). 

The  set  of  all  possible  variants  of  a  term  grows  exponentially  with  the 
number  of  applicable  equalities.  For  example,  given  a  =  b  and  b=  c,  there  are 
eight  possible  variants  of  f(a,b).  Moreover,  the  set  of  all  possible  variants  is 
potentially  infinite.  For  example,  there  are  infinitely  many  variants  of  /?(«), 
given  the  recursive  equality  a  =  h(a).2 3  It  is  therefore  obviously  not  feasible  to 
solve  the  completeness  problem  by  creating  all  possible  variants  of  all  terms 
in  the  data  base. 

Note  that  equality  reasoning  does  not,  by  itself,  add  new  terms  to  the 
reasoning  data  base.  All  that  equality  reasoning  does  is  update  the  equality 
classes  on  existing  terms  when  equalities  change,  and  place  newly  created 
terms  into  the  correct  equality  class.'* 

The  equality  reasoning  algorithm  used  in  BREAD  is  due  to  McAUestor  [4], 


2.3  Truth  Maintenance 

Nonmonotonic  reasoning  is  a  common  feature  of  many  kinds  problem  solving, 
such  backtracking  search  or  hypothetical  reasoning.  In  such  cases,  proposi¬ 
tions  in  the  reasoning  data  base  sometimes  need  to  be  retracted  along  with 

2Such  equalities  are  also  often  called  ‘'circular”,  because  they  correspond  to  cycles  in 
the  graph  of  t lie  equality  relation. 

3The  equality  reasoning  in  BREAD  actually  does  create  new  terms  for  efficiency  reasons 
related  to  retraction  of  equalities  However,  these  are  specially  managed  so  that  they  do 
not  interact  with  pattern-directed  invocation. 


With  Changing  Equalities 


7 


their  consequences.  The  collection  of  facilities  to  support  this  kind  of  rea¬ 
soning  has  come  to  be  called  a  truth  maintenance  system  (tms)  [1,  3]. 4 

The  most  basic  function  of  a  TMS  is  to  keep  track  of  which  propositions 
in  the  reasoning  data  base  are  supported  by  the  current  set  of  premises.  A 
proposition  is  a  boolean- valued  term.  For  example,  both  a+1  and  a  >  0  are 
terms,  but  only  a  >  0  is  a  proposition.  A  premise  is  a  proposition  that  is 
believed  without  support. 

Reasoning  procedures  interface  with  a  TMS  by  asserting  and  retract¬ 
ing  premises  and  installing  dependencies  between  propositions.  To  assert 
a  proposition  means  to  make  it  a  premise.  This  is  a  monotonic  change — it 
can  only  expand  the  set  of  supported  propositions.  To  retract  a  premise 
means  to  remove  it  from  the  current  set  of  premises.  This  is  a  nonmonotonic 
change — the  set  of  supported  propositions  can  shrink  as  a  result. 

The  main  task  of  the  TMS  is  to  incrementally  keep  track  of  which  proposi¬ 
tions  in  the  reasoning  data  base  are  supported,  as  premises  are  asserted  and 
retracted.  A  proposition  is  supported  if  and  only  if  all  of  the  propositions  it 
depends  on  are  either  premises,  or  are  supported.  Like  equality  reasoning,  a 
TMS  does  not,  by  itself,  create  or  destroy  terms. 

There  are  many  other  important  issues  in  the  implementation  and  use 
of  a  TMS  not  discussed  here,  because  they  are  not  relevant  to  the  algorithm 
developed  here. 

A  reasoning  system  that  is  constructed  using  a  TMS  must  obey  a  stringent 
discipline  of  installing  the  appropriate  dependencies  whenever  deductions  are 
made.  For  example,  the  following  section  describes  the  interface  between 
equality  reasoning  and  truth  maintenance. 


2.4  Equality  and  Truth  Maintenance 

In  this  paper,  we  assume  (as  is  the  case  in  BREAD)  that  equalities  are  propo¬ 
sitions  in  the  reasoning  data  base.  This  docs  not  contradict  the  claim  that 
equality  reasoning  per  se  creates  no  new  terms.  For  example,  if  some  rea¬ 
soning  process  is  interested  in  whether  a  is  equal  to  c,  given  the  premises 
o  -6  and  6  =  c,  it  would  typically  add  the  proposition  a  =  c  to  the  data  base. 

4This  is  somewhat  a  misnomer,  since  the  system  isn’t  really  concerned  so  much  with 
what  is  true,  but  rather  the  reasons  for  believing  things.  Some  attempts  have  been  made 
to  relabel  these  systems  as  “belief  maintenance”,  or  “reason  maintenance”,  but  none  of 
these  terms  has  stuck. 
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Equality  reasoning  then  installs  the  appropriate  dependencies  (transitivity), 
from  which  the  TMS  concludes  that  a  =  c  is  supported. 

A  similar  process  take?  place  with  reasoning  by  substitution  of  equals.  In 
the  example  of  Section  2.2,  the  equality  between  variants 

f(g{a,b),c)  =  f(d,e) 

depends  on  the  equalities  used  in  the  substitution: 

d  =  </(<!,&), 

e  =  c. 

Note,  however,  that  even  if  these  supporting  equalities  are  retracted,  it  is 
still  possible  for  f(g(a,b),c)  =  f(d,e)  to  be  a  premise  or  be  supported  for 
other  reasons,  i.e.,  the  two  terms  could  be  equal  but  not  variants.  This  will 
turn  out  to  be  an  important  consideration  in  the  development  of  algorithm 
for  pattern-directed  invocation. 

Another  part  of  the  interface  between  between  equality  reasoning  and 
the  TMS  concerns  the  special  case  of  equality  between  propositions  (boolean¬ 
valued  terms):  A  proposition  is  supported  if  it  is  equal  to  a  supported  propo¬ 
sition.  For  example,  the  proposition  Q  can  depend  on  the  proposition  R  and 
the  equality  Q  —  R.  Similarly,  the  proposition  P(a)  ran  depend  on  the  propo¬ 
sition  P(b)  and  the  equality  P(a)  =  P(b),  which  itself  depends  on  the  equality 
o  =  6. 


3  Completeness  and  Transparency 

We  oegin  this  section  with  a  formal  definition  of  completeness.  The  notion 
of  equivalent  variants  is  then  introduced  and  used  to  define  a  transparent 
demon  as  one  that  has  logically  equivalent  results  when  applied  to  equivalent 
variants.  The  section  culminates  in  a  theorem  establishing  an  implementable 
constraint  on  pattern-directed  invocation  that  guarantees  completeness  for 
transparent  demons.  The  algorithm  developed  in  Section  1  maintains  this 
condition  incrementally. 

3.1  Completness 

The  intuitive  notion  of  completeness  is  t  hat  we  want  to  create  enough  variants 
of  terms  in  the  data  base  to  make  use  of  all  the  knowledge  in  the  demons. 
We  can  lefine  this  more  formally  as  follows. 
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Definition.  A  data  base  of  terms  is  complete  with  respect  to  a 
given  set  of  demons  if  and  only  if  applying  demons  to  matching 
variants  of  terms  in  the  data  base  does  not  change  the  equality 
theory  of  the  data  base. 

The  equality  theory  of  a  data  base  is  the  set  of  propositions  that  follow  logi¬ 
cally  from  supported  propositions  in  the  data  base  by  the  axioms  of  equality 
(transitivity,  symmetry,  reflexivity,  and  substitution  of  equals).  Note  that 
this  defines  completeness  for  a  given  (fixed)  data  base;  the  purpose  of  the  al¬ 
gorithm  in  Section  4  is  to  guarantee  completeness  incrementally  when  demons 
and  terms  are  added  and  equalities  change. 


3.2  Equivalent  Variants 

The  definition  of  completeness  above  can  be  satisified  trivially  (at  least  in 
the  absence  of  recursively  defined  equalities)  by  applying  demons  to  all  pos¬ 
sible  variants  of  every  term  in  the  data  base.  In  this  section  we  define  an 
equivalence  relation  on  variants  that  helps  characterize  which  variants  are 
necessary  for  completeness,  and  which  are  not. 

Let  us  return  to  the  simple  example  demon  introduced  in  Section  1.1: 

Rule  /( 0,?x)  =>  /( 0,?x)  >  0. 

Suppose  the  term  f(a,b)  is  in  the  data  base,  and  a  =  0  and  6=1.  Given 
this  equality  relation,  the  set  of  possible  variants  of  the  term  /(a,  b)  is 

{/(a,l),/(0,6),/(0,l)}. 

The  first  variant,  /(a,  1),  doesn’t  match  the  pattern  of  the  demon,  so  it 
clearly  does  not  need  to  be  created.  The  second  and  third  variants  both 
match  the  pattern.  Only  one  of  these  variants  is  necessary,  however,  to 
conclude  that 

f(a.b)  >  0. 

If  the  demon  is  applied  to  the  variant  /(0,6),  it  will  assert 

/(0,  b)  >  0, 

from  which  f(a,  b)  >  0  follows  by  equality.  Alternatively,  if  the  demon  is 
applied  to  /( 0,1),  it  will  assert 
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from  which  f(a,b )  >  0  follows  by  equality.  (The  algorithm  in  Section  4 
chooses  /(0,  b)  for  reasons  which  will  be  explained  later.) 

In  the  case  of  flat  patterns,  only  one  variant  is  necessary  to  achieve  com¬ 
pleteness.  The  main  complexity  in  the  completeness  problem  arises  in  the 
case  of  nested  patterns.  To  illustrate,  consider  the  following  demon 

Rule  P(/i(?x))  =*  P(/i(?x))  Q(?x), 

which  implements  the  axiom5 

Vx  P(h(x))  —v  Q(x). 

Suppose  the  term  P{c)  is  in  the  data  base,  and 

c  -  h  ( a ) , 
c  =  h(b), 

but  it  is  not  the  case  that  a=b  (i.e.,  h(a)  and  h(b)  are  equal,  but  not  variants). 
The  set  of  matching  variants  of  P(c)  in  this  situation  is 

{P(h(u)),P(h(b))). 

In  this  case,  both  of  these  variants  are  needed  for  completeness.  If  only 
the  variant  P(h(a))  is  created,  then  the  demon  will  assert  P(h(a))  — ♦  Q{a). 
If  P(c)  is  true,  <3(a)  will  be  deduced,  but  Q(b)  will  not.  Similarly,  if  only  the 
variant  P(h(b))  is  created,  then  the  demon  will  assert  P(k(b))  —*  Q(b).  but 
Q{a)  will  not  be  deduced. 

To  make  things  even  more  complex,  now  suppose  in  addition  that 

a  —  d, 

b  -=  e. 

There  are  now  two  more  matching  variants  possible:  P(li(d)}  and  P(h( <:)). 
However,  if  the  demon  has  already  been  applied  to  tin'  P(h(<i))  and  P(/t (/»)). 
there  is  no  need  to  create  these  two  new  variants,  since  applying  the  demon 
to  them  would  result  in  the  assertions  P(h(d))  -+  Q{d)  and  P(h(f))  — >  Q(c). 
which  follow  by  substitution  from  the  assertions  above. 

The  reason  why  P(h(a)\  and  P(h{l>))  are  both  needed  is  that  they  are 
equal  by  substitution  at  intermediate  levels  in  the  pattern  P(h(lx)).  In 

5  A  concrete  instance  of  this  schema  is  the  axiom  Vx  log  x  >0  —  x  >  1 


I 


With  Changing  Equalities  11 

contrast,  P(h(a))  and  P(h(d))  are  equal  by  substitution  in  positions  occupied 
by  variables  in  the  pattern.  Thus  if  the  demon  body  does  not  depend  on 
the  internal  structure  of  the  variable  bindings  it  gets,  it  cannot  distinguish 
between  P(h(a))  and  P(h(d))  (this  notion  is  formalized  in  the  next  section). 

In  general,  we  can  define  the  following  equivalence  relation  on  the  set 
of  variants  that  match  a  given  pattern.  Note  that  this  definition  involves 
only  terms  and  patterns.  The  pattern  of  a  demon  is  the  only  declarative 
information  about  a  demon  that  will  be  available  to  the  pattern-directed 
invocation  algorithm. 

Definition.  Two  variants  matching  a  given  pattern  are  equiva¬ 
lent  with  respect  to  the  given  pattern  and  an  equality  relation  if 
and  only  if  they  differ  only  by  substitution  of  equals  at  positions 
occupied  by  variables  in  the  pattern. 

Thus  in  the  first  example  above,  /(0,  6)  and  /(0,1)  are  equivalent  variants 
with  respect  to  the  pattern  /( 0,?x)  and  the  given  equality  relation. 

In  the  case  of  flat  patterns,  all  matching  variants  are  equivalent.  This 
follows  because  two  terms  that  match  the  same  flat  pattern  can  differ  only 
at  positions  occupied  by  variables  in  the  pattern. 

In  the  second  example,  the  set  of  matching  variants  of  P(c)  is  partitioned 
into  equivalence  classes  as  follows: 

{ {P(M«0),  P(M<*))}.  <P(M‘)).  P(Me))}  }• 

To  guarantee  completeness  with  respect  to  this  demon,  we  need  to  apply  the 
demon  to  <"»ne  variant  from  each  of  these  classes. 

In  the  next  section,  the  notion  of  equivalent  variants  is  used  together  with 
the  notion  of  transparent  demons  to  prove  the  key  theorem  of  the  paper. 

3.3  Transparent  Demons 

The  two  example  demons  in  the  preceding  sections  both  have  the  property 
that  equivalent  variants  are  redundant.  It  is  possible,  however,  to  write 
demons  that  do  not  have  this  property.  For  example,  consider  the  following 
demon  as  an  alternate  implemention  of  the  axiom  introduced  in  Section  1.1. 

(  f(ly,  lx)  ,  \t .  if  ti  =  0  then  assert(/  >  0)  ). 


TO 
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(The  symbols  t0,  tj ,  and  t2  denote  the  operator,  first,  and  second  argu¬ 
ments  of  the  term  t,  respectively.)  The  problem  with  this  demon  is  that  it  is 
hiding  some  of  its  pattern  matching  inside  of  the  body,  where  it  is  inacces¬ 
sible  to  the  pattern-directed  invocation  algorithm.  For  example,  given  a  =  0 
and  6=1,  the  following  set  of  matching  variants  are  all  equivalent: 

{/M),/K  i),/(o,6),/(o,i)}. 

However,  if  the  demon  is  applied  only  to  f{a,b)  or  /(a,  1),  the  desired 
conclusion /(a,  6)  >0  will  not  be  deduced,  because  in  both  cases  the  condition 
in  the  body  will  be  false  and  the  demon  will  make  no  assertion. 

The  property  that  this  demon  lacks,  and  which  the  other  example  demons 
in  the  paper  thus  far  possess,  can  defined  formally  as  follows: 

Definition.  A  demon  is  transparent  if  and  only  if  it  results 
in  data  bases  with  the  same  equality  theory,  when  applied  to 
equivalent  variants  with  respect  to  its  pattern  and  a  given  data 
base. 

One  might  think  to  define  a  transparent  demon  more  simply  as  one  that 
results  in  the  same  data  base,  when  applied  to  equivalent  variants.  For 
example,  the  the  demon  above  is  not  transparent,  because  the  data  base 
resulting  when  it  is  applied  to  the  matching  term  /(0,6)  includes  the  propo¬ 
sition  /( 0,  b)  >  0,  whereas  the  data  base  resulting  when  it  is  applied  to  the 
equivalent  matching  variant  /(a,  6)  does  not. 

However,  consider  the  example  of  a  t  ransparent  demon  in  Section  3.2.  The 
data  base  resulting  when  it  is  applied  to  the  matching  term  /(0,6)  is  not  the 
same  as  the  data  base  resulting  when  it  is  applied  to  the  equivalent  matching 
variant  /( 0,1).  (One  data  base  includes  the  proposition  /(0,6)  >  0,  while 
the  other  includes  the  proposition  /(0, 1)  >  0,  and  not  vice  versa.)  However, 
the  equality  theory  of  the  two  data  bases  is  the  same  (given  a  =  0  and  6=1). 

Any  demon  written  in  the  rule  notation  introduced  in  Section  2.1  is  guar¬ 
anteed  to  be  transparent.  In  general,  the  behavior  of  the  body  of  a  t  rans¬ 
parent.  demon  must  not  be  conditional  on  the  values  of  its  bindings,  i.e.,  the 
subterms  of  the  input  term  that  correspond  to  variable  positions  in  the  pat¬ 
tern.  (BREAD  does  not  attempt  to  automatically  analyze  demon  bodies  to 
check  this  property.) 

The  ultimate  motivation  for  the  definition  of  transparency  above  is  the 
following  theorem,  which  establishes  an  implementable  constraint  on  pattern- 
directed  invocation  that  guarantees  completeness. 
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Theorem  1  (Completeness)  A  data  base  of  terms  is  complete  with  respect 
to  a  given  set  of  transparent  demons  if  each  demon  is  applied  to  (at  least) 
one  member  of  each  equivalence  class  of  matching  variants  of  every  term  in 
the  data  base. 

The  proof  of  this  theorem  is  as  follows.  Consider  applying  a  demon  d  to 
a  matching  variant  v  of  a  term  t  in  the  data  base.  The  antecedent  of  the 
theorem  states  that  d  has  already  been  applied  to  an  equivalent  matching 
variant  v'.  Since  d  is  transparent,  the  data  base  resulting  from  applying  d  to 
v  has  the  same  equality  theory  as  the  data  base  resulting  from  applying  d  to 
v'.  Therefore  the  data  base  is  complete. 

We  assume  in  the  rest  of  this  paper  that  demons  are  transparent,  unless 
stated  otherwise. 

3.4  Opaque  Demons 

An  opaque  (i.e.,  non-transparent)  demon  can  often  be  made  transparent  by 
removing  conditionals  from  the  body  and  changing  the  pattern.  For  example, 
consider  the  following  opaque  demon,  which  implements  the  idempotent  law, 
Vx  /(x,  x)  =  x,  for  the  operator  /: 

(  /(?x,  ?y)  ,  Xt .  if  ti  =  t2  then  assert(f  =  t\)  ). 

This  demon  can  be  rewritten  in  the  following  transparent  form: 

Rule  /(?x,?x)  =>  /(?x,?x)  =  ?x. 

It  might  occur  to  the  reader  to  try  to  make  the  opaque  demon  above 
transparent  by  replacing  t\  =  t2  by  t{  —  t2  in  the  body.  Although,  technically 
speaking,  the  demon  in  this  form  would  be  transparent,  it  would  not  be  a 
very  good  implementation  of  the  idempotency  law,  for  two  reasons.  First, 
the  law  would  only  be  instantiated  for  applications  of  /  that  are  added  to 
the  data  base  after  the  equality  between  fi  and  t2  is  established.  Second, 
if  the  equality  between  fi  and  t2  is  later  retracted,  the  proper  dependencies 
have  not  been  installed  to  cause  <  =  tj  to  be  retracted  also. 

If  there  is  more  than  one  conditional  in  the  body  of  a  demon,  it  may  be 
necessary  to  break  it  up  into  several  transparent  demons,  corresponding  to 
the  different  cases.  For  example,  the  following  opaque  demon 


/ 

t 
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(  /(?x,?y)  ,  Xt .  case 

t\  =  0:  case o 
<i  =  l:  casej  ) 

can  be  rewritten  as  the  following  two  transparent  demons: 

(  f(0,?y)  ,  Xt.caseo  ) 

(  /(l,?y)  ,  Xt .  casei  ) 

Not  all  opaque  demons  can  be  handled  in  this  way.  For  example,  consider 
the  following  opaque  demon:6 

{  /(?£,  ?y)  ,  Xt  .it  lt(<i,  <2)  then  assert(f  >  0)  ). 

There  is  no  way  to  make  this  demon  transparent  simply  by  giving  it  a 
different  pattern  or  expanding  it  into  a  (finite)  number  of  demons  with  the 
same  body.  To  guarantee  complete  use  of  the  knowledge  in  this  demon,  it 
would  have  to  rewritten  as 

Rule  /(?x,?y)  ?x  <  ?y  — ►  /(?x,?y)>  0. 

This  version  of  the  demon  has  the  disadvantage  that  it  will  instantiate  the 
right-hand  side  for  a II  applications  of  /.  However,  it  does  enable  proving 
that  /(a,  b)  >  0  when  only  the  relationship  between  a  and  b  is  known,  not 
their  specific  numerical  values. 

4  Development  of  the  Algorithm 

In  this  section  we  develop  a  complete  algorithm  for  pattern-directed  invoca¬ 
tion  in  two  stages.  The  first  stage  develops  a  complete  algorithm  for  the  case 
of  flat  patterns.  The  second  stage  reduces  the  case  of  nested  patterns  to  flat 
patterns  by  introducing  intermediate  demons. 

Within  the  development  of  the  flat  pattern  case,  there  are  three  steps. 
The  first  step,  in  Section  4.1,  introduces  a  simple  generalization  of  the  stan¬ 
dard  pattern-directed  invocation  algorithm,  and  shows  that  it  maintains  com¬ 
pleteness  when  new  demons  or  terms  are  added.  In  Section  4.2,  an  additional 

6Note  that  “It”  is  a  test  in  the  programming  language,  which  is  executed  when  the  body 
of  the  demon  is  executed.  This  should  not  be  confused  with  which  is  the  operator  of 
a  term  in  the  data  base. 
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procedure  is  introduced  to  maintain  completeness  when  new  equalities  are 
asserted.  Finally,  in  Section  4.3,  we  prove  that  no  action  is  required  when 
equalities  are  retracted,  due  to  the  use  of  the  closest  matching  variant. 

Section  4.4  introduces  the  procedure  the  flattens  nested  patterns  and 
proves  that  completeness  is  maintained  in  this  case.  Section  4.5  discusses 
termination  of  the  algorithm.  Section  4.6  describes  some  improvements  to 
the  algorithm  that  reduce  the  number  of  redundant  proof  paths. 


4.1  A  Simple  Generalization 

Figure  2  shows  a  simple  generalization  of  the  standard  algorithm  for 
pattern-directed  invocation.  The  most  basic  change  is  that  the  identity  (=) 
tests  in  lines  5  and  15  of  the  standard  matching  procedure  have  been  re¬ 
placed  by  equality  (=)  tests.  Thus,  when  match-w-equality  succeeds,  it  does 
not  necessarily  mean  that  the  given  term  matches  the  given  pattern,  but  only 
that  the  given  term  or  some  variant  matches  the  given  pattern.  The  bindings 
returned  by  the  matcher  indicate  a  matching  term. 

In  line  21,  the  try-w-equality  procedure,  which  is  called  whenever  a  demon 
or  new  term  is  added,  uses  the  bindings  returned  by  the  matcher  to  create  a 
matching  variant,  if  necessary.  (Substitute^,  B)  returns  the  pattern  resulting 
from  substituting  the  bindings  B  in  the  pattern  p .)  For  example,  when  /(a,  6) 
is  matched  against  the  pattern  /(0,?x),  with  a= 0,  match-w-equality  succeeds 
and  returns  the  bindings  {?x  <—  6}.  Try-w-equality  then  creates  the  matching 
variant  /( 0,  b).7 

Note  that  when  a  new  matching  variant  term  is  created,  the  add-term- 
w-equality  procedure  is  not  called.  This  feature  of  the  algorithm  is  crucial 
to  its  termination  (see  Section  4.5).  The  new  term  is  in  a  sort  of  “limbo” 
state,  in  which  it  exists  from  the  standpoint  of  the  equality  system,  but  not 
from  the  standpoint  of  pattern-directed  invocation.  This  term  will  be  added 
to  the  data  base  only  if  add-term-w-equality  is  explicitly  called  with  it  as  an 
argument. 

Note  also  (line  22)  that,  when  the  matching  variant  ( v )  is  already  in  the 
data  base,  try-w-equality  does  not  apply  the  given  demon,  unless  v  =  t.  This 
is  because  under  these  conditions,  the  given  demon  will  have  been  applied 

7One  might  also  think  of  simply  applying  the  demon  directly  to  the  given  term  when 
the  match  succeeds,  and  never  bothering  to  create  the  matching  variant.  This  approach 
is  discussed  in  Section  5.1. 
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function  match-w-equality(p,  t,  B) 

|  Match  pattern  p  to  term  t  with  bindings  B  with  equalities. 


p  is  a  variable: 

if  p  is  bound  to  s  in  B 

then  if  s  =  t 

then  return  B  [  same  value  ]] 
else  fail  f  different  value  ]j 

else  return  BU  {p  *—  t}  |  new  binding  J 
p  is  a  pattern  p0(pi,  •  -  • ,  p„): 
if  t  is  a  term  to(t\ , . .  • ,  tn) 

then  for  i  in  {0, . . . ,  n}  do 

B  <—  match- w-equality(pt,  B) 

return  B  f  all  positions  match 

else  fail 
p  is  a  constant: 

if  p  =  t  then  return  B 
else  fail 


is  function  try-w-equality(Cp,  d) 

d  Match  term  t  against  pattern  p  with  equalities 
and  apply  body  d  to  t  or  matching  variant ,  if  successful.  J 

19  B  <—  match-w-equality(p,  t,  0) 

20  if  match  succeeded 

21  then  v  i—  the-term(substitute(p,  B)) 

22  if  v  is  not  in  data  base  or  v  =  t 

23  then  apply  d  to  v  [and  B] 

Figure  2.  A  simple  generalization  of  the  standard  pattern-directed  invocation 
algorithm  in  Figure  1,  which  is  complete  for  flat  patterns  only.  (Continued  on 
next  page.) 
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24  function  add-term-w-equality(t) 

[  Add  new  term  t  to  data  base  with  equalities.  J 

25  for  each  demon  {  p  ,  d  ) 

26  try-w-equality(t,p,  d) 

27  add  t  to  data  base 

28  function  add-demon-w-equality(p,d) 

|  Add  demon  with  pattern  p  and  body  d  with  equalities.  J 

29  for  each  term  t  in  data  base 

30  try-w-equality(t,p,  d) 

31  add  (  p  ,  d  )  to  demons 


Figure  2.  (Continued) 

to  v  when  try-w-equality  was  called  on  v,  t,  and  d.  It  is  necessary  for  this 
call  to  have  occurred,  since  every  term  in  the  data  base  is  tried  with  every 
pattern.  Note  that  we  are  relying  on  the  fact  that  the  bindings  returned  by 
match-w-equality  will  correspond  to  its  argument  t  if  t  matches  the  pattern 
with  no  substitutions. 

In  certain  circumstances,  the  algorithm  in  Figure  2  causes  the  same  demon 
to  be  applied  more  than  once  to  the  same  term.  This  inefficiency  is  eliminated 
in  Section  4.6. 

The  algorithm  in  Figure  2  is  complete  for  flat  patterns  and  a  fixed  equal¬ 
ity  relation.  It  is  clear  by  the  definition  of  a  variant,  that  the  matcher  is 
guaranteed  to  find  a  matching  variant,  if  possible.  In  the  case  of  flat  pat¬ 
terns,  all  matching  variants  are  equivalent,  so  only  one  matching  variant  is 
required  to  satisfy  the  condition  of  the  Completeness  Theorem. 

In  the  next  two  sections,  we  extend  this  algorithm  to  the  case  of  changing 
equalities,  but  still  considering  only  flat  patterns.  Following  this,  we  treat 
the  case  of  nested  patterns. 

4.2  Changing  Equalities 


When  a  new  equality  is  asserted,  two  previously  separate  equality  classes 
are  joined.  This  in  turn  makes  new  variants  possible.  Any  term  with  a 
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subterm  in  either  of  the  two  equality  classes  must  be  re- matched  against  the 
patterns  of  all  demons  (see  Figure  3).  Matches  which  failed  before  may  now 
succeed. 

For  example,  suppose  there  is  a  demon  with  the  pattern  f(?x,  1).  When 
this  pattern  is  first  matched  against  the  term  /(a,  6),  with  a  =  0  and  no  other 
equalities,  the  match  fails.  If  the  equality  6=  1  is  asserted,  however,  the  term 
f(a,b )  needs  to  be  re-matched  (because  it  has  b  as  a  subterm).  This  time 
the  match  succeeds,  yielding  the  matching  variant  /(a,  1).  In  BREAD,  this 
re-matching  process  is  made  more  efficient  by  indexing  terms  and  patterns 
according  the  subterms  that  appear  in  them. 

When  an  equality  is  retracted,  some  of  the  variants  of  a  given  term  match¬ 
ing  a  given  pattern  may  cease  being  equal  to  the  given  term.  This  suggests 
that  re- matching  is  also  required  when  equalities  are  retracted. 

For  example,  consider  the  set  of  variants  of  f(a,  b )  matching  the  pattern 
/(0,  ?x),  with  a  =  0  and  6=1: 


{/(0,6),/(0,l)}. 

Since  all  of  these  equivalent,  from  the  standpoint  of  the  Completeness  The¬ 
orem,  it  doesn’t  matter  which  is  created  for  the  demon  to  be  applied  to. 

Suppose  that  the  variant  /( 0, 1)  is  chosen,  and  the  demon  from  Section  1.1 
is  applied  to  it.  The  demon  asserts 

/(0, 1)  >  0, 

from  which  the  system  can  deduce  by  equality  reasoning  that 

/(a. 6)  >  0. 
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1  function  join-equality-classes((7j,  C2) 

[  Join  equality  classes  C\  and  C2-  1 

2  for  each  term  t  in  data  base 

3  if  a  subterm  of  t  is  in  C t  UC2 

4  then  for  each  demon  (  p  ,  d  ) 

5  try-w-equality (t,  p,  d) 


Figure  3.  Procedure  to  be  executed  whenever  two  equality  classes  are  joined. 
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Now  suppose  that  b—  1  is  retracted,  but  a  —  0  remains.  The  system  is  now 
incomplete.  In  order  to  regain  completeness,  the  pattern  /(0,?a:)  needs  to  be 
re-matched  against  f(a,  b),  in  order  to  create  the  variant  /(0,fe)  and  apply 
the  demon  to  it.  Note,  however,  that  if  f(0,b)  were  chosen  in  the  first  place, 
there  would  be  no  need  for  re-matching. 

The  basic  idea  why  it  was  a  bad  idea  to  choose  the  variant  /( 0, 1)  above 
is  because  this  variant  uses  more  substitutions  than  necessary  to  match  the 
pattern  (e.g.,  substituting  1  for  b  was  unnecessary).  These  extra  substitutions 
make  the  deductive  chain  from  the  variant  to  the  original  term  susceptible 
to  be  being  broken  when  the  irrelevant  equalities  (e.g.,  b=l)  are  retracted. 
We  formalize  this  notion  in  the  next  section,  and  show  that  the  algorithm 
in  Figure  2  always  chooses  a  matching  variant  (e.g.,  /(0 ,6)),  such  that  com¬ 
pleteness  is  guaranteed  without  re-matching  on  retraction  of  equalities. 

4.3  Closest  Matching  Variants 

The  matching  variant  chosen  by  the  algorithm  in  Figure  2  is  as  “close”  as 
possible  to  the  given  term,  in  sense  of  depending  on  a  minimum  number  of 
equalities.  We  call  this  variant  a  closest  matching  term,  defined  as  follows. 

Definition.  For  a  given  term  and  flat  pattern,  a  closest  matching 
term  is  a  matching  term  in  which,  for  each  variable  in  the  pattern, 
the  corresponding  subterm  in  the  matching  term  is  identical  to 
(at  least)  one  of  the  corresponding  subterms  in  the  given  term. 

Note  that  the  definition  of  closest  matching  term  does  not  involve  the 
equality  relation.  Also,  there  is  not  always  a  unique  closest  matching  term. 
For  example,  both  f(a,a )  and  f(b,  b )  are  closest  matching  terms  for  the  term 
f{a,b)  and  the  pattern  /(?x,?i).  (Section  4.6  describes  an  improvement  to 
the  matching  algorithm  which  uniquely  chooses  a  closest  matching  variant.) 
For  a  term  and  pattern  that  match  without  any  substitutions,  the  closest 
matching  term  is  (uniquely)  the  term  itself. 

The  important  property  of  closest  matching  terms  is  given  by  the  follow¬ 
ing  theorem. 

Theorem  2  (Closest  Matching  Term)  For  a  given  term  and  flat  pat¬ 
tern,  if  a  closest  matching  term  is  not  a  variant,  then  there  are  no  matching 
va  riants. 
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The  proof  of  this  theorem  is  more  easily  understood  using  the  contraposi¬ 
tive  form:  If  there  is  any  matching  variant,  then  every  closest  matching  term 
is  a  variant. 

Let  v  be  a  variant  of  the  term  t  matching  the  flat  pattern  p.  Let  c  be  a 
closest  term  to  t  matching  p.  Let  ?x  be  a  variable  in  p.  Since  v  matches  p. 
it  must  have  the  identical  subterm  vx  in  each  position  corresponding  to  ?x. 
Since  c  matches  p,  it  must  have  the  identical  subterm,  cx,  in  each  position 
corresponding  to  ?x.  Furthermore,  since  c  is  a  closest  matching  term  to 
t,  cx  must  be  one  of  the  subterms  of  t  corresponding  to  ?x.  Since  v  is  a 
variant  of  t,  vx  must  be  equal  to  each  subterm  of  t  corresponding  to  ?x;  in 
particular,  vx  =  cx.  Since  v  and  c  both  match  p,  they  can  differ  only  in  the 
variable  positions  of  p.  We  have  shown  that,  for  an  arbitrary  variable  ?x, 
the  corresponding  subterms  vx  and  cx  are  equal:  therefore  c  is  a  variant  of  v. 
Since  v  is  a  variant  of  t,  c  is  also  a  variant  of  t. 

This  theorem  guarantees  that  no  re-matching  needs  to  be  done  when  an 
equality  is  retracted.  Patterns  and  terms  that  did  not  match  before  certainly 
will  not  match  after  the  retraction.  For  patterns  and  terms  that  did  match 
before  the  retraction,  the  body  of  the  demon  was  applied  to  a  closest  match¬ 
ing  term.  If  that  term  is  no  longer  a  variant  after  retraction,  this  theorem 
guarantees  that  there  are  no  matching  variants,  so  there  is  no  point  in  trying 
to  re-match. 

The  use  of  the  closest  matching  variant  can  lead  to  the  creation  of  more 
than  the  minimum  number  of  matching  variants  required  to  guarantee  com¬ 
pleteness.  It  is  essentially  a  trade-off  between  creating  extra  variants  when 
a  term  is  added  to  the  data  base  versus  extra  matching  when  an  equality  is 
retracted.  This  trade-off  is  discussed  further  in  Section  5.1. 

4.4  Flattening  Nested  Patterns 

The  match-w-equality  algorithm  yields  at  most  a  single  variant.  Since  we 
have  already  seen  examples  in  Section  3.2  involving  nested  patterns,  wherein 
more  than  one  variant  is  needed  to  guarantee  completeness,  this  algorithm 
is  clearly  incomplete  for  the  general  case  of  nested  patterns. 

One  might  consider  trying  to  directly  generalize  match-w-equality  further 
to  make  it  complete  for  nested  patterns.  This  would  entail  matching  against 
all  members  of  equality  classes  at  intermediate  levels  in  a  pattern.  For  ex¬ 
ample,  in  Section  3.2,  when  matching  the  pattern  P(h(‘!x))  against  the  term 
P{c),  one  would  match  the  subpattern  /i(?x)  against  all  terms  equal  to  c  (not 


21 


With  Changing  Equalities 


only  variants).  In  this  case  the  equality  class  of  c  contains  the  terms  h(a ) 
and  h(b). 

In  the  presence  of  changing  equalities,  however,  this  approach  is  not  sat¬ 
isfactory,  since  it  does  not  keep  track  of  intermediate  equalities  (equalities 
corresponding  to  intermediate  levels  in  patterns).  For  example,  suppose  that 
h(a)  —  h(b)  is  asserted  only  after  the  matching  of  P(h(?x))  against  P(c)  had 
been  performed.  The  variant  P(h(b))  will  not  be  created  unless  the  match¬ 
ing  is  re-done.  This  would  require  all  terms  and  patterns  to  be  re- matched 
whenever  any  equalities  are  added,  making  the  cost  of  incremental  assertion 
prohibitive. 

In  this  section,  we  show  how  to  reduce  the  case  of  nested  patterns  to  the 
case  of  flat  patterns  (for  which  we  have  a  complete  solution)  by  automat¬ 
ically  defining  intermediate  demons.  The  basic  idea  of  the  reduction  is  to 
incrementally  process  nested  patterns  “from  the  inside  out”.  When  an  at¬ 
tempt  is  made  to  add  a  demon  with  a  nested  pattern  to  the  system,  instead 
of  adding  it  directly,  we  look  inside  the  pattern  to  find  a  flat  subpattern  at 
some  level  of  nesting.  An  intermediate  demon  is  then  added  with  that  (flat) 
pattern  and  a  body  that  “remembers”  the  pattern  and  body  of  the  original 
demon.  The  rationale  for  this  process  is  that,  if  there  is  no  variant  of  a  term 
in  the  data  base  matching  the  flat  subpattern,  then  there  certainly  can’t  be 
a  variant  matching  the  whole  pattern. 

When  the  intermediate  demon  is  invoked,  it  uses  its  substitutions  to  make 
a  new  pattern  with  fewer  variables.  If  this  pattern  is  still  nested,  a  further 
intermediate  demon  will  be  added  for  a  flat  subpattern,  and  so  on.  This 
process  stops  when  there  are  no  nested  patterns  left,  at  which  point  the 
original  body  gets  executed. 

The  algorithm  which  performs  the  incremental  flattening  process  sketched 
above  is  shown  in  Figure  4.  We  will  first  explore  its  behavior  with  some 
examples,  and  then  give  a  proof  of  its  completeness  in  general.8 

We  illustrate  the  algorithm  with  the  example  from  Section  3.2.  The 
definition  of  the  demon 

Rule  P(h(?x))  =►  P(h(?x))  -»  Q(?x) 


results  in  the  call 

8Since  match-w-equality  will  now  only  be  called  on  flat  patterns,  the  recursion  in  thp 
definition  of  match-w-equality  can  be  eliminated  simply  by  “unrolling"  the  recursive  call 
one  level  and  then  removing  the  recursive  case 
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1  function  flatten(p,  d ) 

I  Add  demon  with  possibly  nested  pattern  p  and  body  d. 

2  if  p  is  nested 

3  then  choose  a  subpattern  p,  ot  p 

4  flatten(p,,  XtB  .  flatten(substitute(p,  B),  d)) 

5  else  add-demon-w-equality(p,  d)  [  flat  case  J 


Figure  4.  The  algorithm  for  adding  a  new  demon  with  a  pattern  that  may  include 
subpatterns.  Note  that  this  is  the  tail-recursive  definition  of  an  iterative  algorithm. 


flatten(P(/i(?i)),  body), 


which  finds  the  flat  subpattern  h(?x)  and  adds  the  intermediate  demon 


(  h(Ix)  ,  XtB .  flatten(substitute(P(/i.(?.r)),  B),  body)  ). 


(Note  that  we  are  assuming  here  that  both  the  matching  term  t  and  the 
corresponding  bindings  B  are  passed  to  the  body  of  a  demon.) 

Given  the  term  P(c)  and  the  equality  relation9 


c  =  h(a), 

c  =  h(b), 


the  pattern  of  the  intermediate  demon  matches  against  the  term  h(a)  return¬ 
ing  the  binding  {'lx  <—  a}.  The  body  of  the  demon  substitutes  this  binding 
in  the  pattern  P(h(1x))  and  calls 


flatten(/''(/i(o)),  body). 


This  call  to  flatten  goes  directly  to  the  flat  case,  adding  the  demon 


(  P(h(a))  -  body  ), 


9 It  does  not  matter  whether  this  term  and  equalities  were  present  at  the  time  the 
demon  was  added  to  the  system,  or  whet  her  they  were  added  afterwards,  since  the  system 


has  already  been  shown  to  be  complete  for  demons  with  fiat  patterns,  regardless  of  the 
order  of  events. 
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which,  when  matched  against  the  term  P(c),  causes  the  original  body  to  be 
applied  to  the  variant  P(h(a)),  asserting 

P(h(a))  ->  Q(a). 

An  analogous  process  starting  with  h(b)  causes  the  body  to  be  applied  to 
the  variant  P(h(b)).  As  discussed  in  Section  3.2,  these  are  the  two  matching 
variants  of  P(c )  required  for  completeness. 

Note  that  even  if  the  equalities 

a  =  d 
b  =  e 

are  supported  (or  are  asserted  later),  no  additional  variants  are  created, 
because  h(d)  and  h{e )  are  equivalent  to  h(a)  and  h(b),  respectively,  with 
respect  to  the  pattern  h(lx). 

Additional  example  traces  of  the  flattening  of  nested  patterns  are  given  in 
Figures  5  and  6.  Figure  5  illustrates  the  flattening  of  a  complex  pattern  with 
multiple  variables  and  multiple  occurrences  of  variables.  Figure  6  illustrates 
that  the  final  intermediate  demon  does  not  always  have  a  degenerate  pattern 
with  no  variables — it  only  must  be  flat. 

The  completeness  of  the  flattening  algorithm  is  established  by  the  follow¬ 
ing  two  theorems. 

Theorem  3  For  a  given  term  and  demon,  the  flattening  algorithm  applies 
the  body  of  the  demon  to  at  least  one  term  from  each  equivalence  class  of 
matching  terms. 

Theorem  4  For  a  given  term  and  (possibly  nested)  pattern,  if  all  of  the 
variants  created  by  the  flattening  algorithm  in  any  equivalence  class  stop  being 
variants,  then  there  are  no  matching  variants  in  that  class. 

The  proofs  of  both  theorems  are  based  on  the  following  observation.  Let  v 
be  some  variant  of  t  which  matches  p.  Let  lx  be  some  variable  in  p ,  and  let  vx 
be  the  corresponding  subterm  of  v.  Let  p'  be  the  first  flat  pattern  containing 
lx  which  is  added  by  flatten,  and  v'  be  the  subterm  of  v  corresponding  to  p' . 
If  v'  is  present  in  the  data  base,  it  will  match  p'  and  vx  will  be  substituted  for 
lx  in  further  patterns  generated  in  the  process  of  obtaining  a  variant.  If  v' 
is  not  present  in  the  data  base,  it  must  be  a  variant  of  some  term  s  which  is 
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Rule  5(/(?x,?y),/(?x,?2))=>P(/(?x,?y),/(?x,?2))  =  /(?x,fi,(?y,?z)) 
Initial  data  base: 


9(r,s) 

r  -  /(a,  b) 

s  =  f(d, c ) 
a  =  d 


•  flatten (y(/(?x,  ?y),  /(?x,  ?2)),  iody) 
t>  adds  (  /(?x,?y)  ,  body'  } 

where  body'  is  AfZ?  .  flatten(substitute(y(/(?x,  ?y ), /(?x,  ?2)),  B),  body ) 


f(a,b)  matches  /(?x,  ?y)  with  bindings  {?x  <—  a,  ?y  <—  6} 
t>  body 1  applied  to  f{a,b) 

o  flatten(y(/(a,6),/(a,?2)),k>dy) 
t>  adds  (  /(a,  ?z)  ,  body"  ) 

where  6odyr/  is  Af£? .  flatten(substitute(y(/(a,  b),  f(a,  ?z)),  B),  body ) 


f{d,c)  matches  f(a,7z)  with  bindings  {7z 
o  body"  applied  to  /(a,  c) 

o  flatten(y(/(a,  A),  /(a,c)),  6ody) 

t>  adds  (  g(f(ci,b),f(a,c))  ,  body  ) 


c}  yielding  variant  /(a,c) 


y(r,s)  matches  g(f(a,b),f(a,c))  yielding  g(f(a,b),f(a,c)) 
t>  body  applied  to  g(f[a,b),f(n,c)) 

>  asserts  y(/(a,  6),  /(a,  c))  =  /(a,  y(6,  c)) 


Figure  5.  A  trace  of  the  process  by  which  a  demon  implementing  the  distributive 
law  for  /  over  g  is  invoked  in  an  example  situation. 
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Initial  data  base: 


(  ,  body  ) 


f(c,d) 


c  =  h(a) 


•  flatten(/(/i(?:r),  ?y),  iody) 

>  adds  {  h{lx)  ,  body'  ) 

where  body'  is  XtB  .  flatten(substitute(/(h(?x),  ?y),  B),  body) 

•  h(a)  matches  h(?x)  with  bindings  {?x  a} 
c>  body'  applied  to  h(a ) 

i>  flatten (f(h(a),ly),  body) 

c>  adds  (  f(h(a),ly)  ,  body  ) 


•  f{c,d)  matches  /(/i(a),?y)  with  bindings  {?y  <—  d}  yielding  f(h(a),d) 
t>  body  applied  to  f(h(a),d) 

Figure  6.  An  example  trace  of  the  execution  of  a  demon  with  a  nested  pattern, 
in  which  the  final  intermediate  demon  does  not  have  a  degenerate  pattern. 
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in  the  data  base,  since  otherwise  v  could  not  be  a  variant  of  t.  When  match- 
w-equality  is  called  on  p'  and  s  it  must  succeed,  since  there  is  a  matching 
variant— v*.  The  bindings  produced  will  have  a  subterm  sx  of  s  for  ?x;  since 
p'  is  flat,  it  must  be  the  case  that  sT  —  vx  holds  (this  part  is  similar  to  the 
proof  of  the  Closest  Matching  Term  Theorem  for  the  flat  case).  In  both 
cases,  the  process  must  finally  yield  a  variant  of  i,  since  one  such  a  variant 
is  known  to  exist. 

We  have  shown  that,  for  an  arbitrary  variable,  a  variant  will  be  produced 
which  has  either  the  subterm  of  v  corresponding  to  that  variable  or  another 
term  which  is  equal  to  it  Therefore  that  variant  differs  from  v  only  in  the 
positions  occupied  by  variables  in  p,  and  thus  is  equivalent  to  v.  Since  v  was 
an  arbitrary  variant,  this  shows  that  the  algorithm  will  yield  at  least  one 
variant  from  each  equivalence  class. 

Now  consider  the  contrapositive  of  Theorem  4.  Suppose  after  some 
changes  to  the  equality  relation,  there  is  some  variant  v  of  t  which  matches 
p.  From  the  argument  above  it  follows  that  some  equivalent  variant  of  v  has 
been  added  by  the  cascading  algorithm.  This  variant  must  still  be  equal  to 
v  and  therefore  also  to  t. 

A  final  point  to  note  about  the  flattening  algorithm  is  that  it  can,  in 
the  worst  case,  lead  to  the  creation  of  an  exponential  number  of  variants. 
Sometimes,  all  of  these  terms  are  necessary  to  guarantee  completeness. 

In  the  case  of  flat  patterns,  each  match  between  a  pattern  and  a  term 
can,  in  the  worst  case,  lead  to  the  creation  of  a  single  new  variant.  Therefore, 
the  number  of  variants  created  by  the  algorithm  will  not  exceed  the  product 
of  the  number  of  patterns  and  the  number  of  terms  in  the  data  base. 

Consider,  however,  the  nested  pattern  p(h(?x i), .  . . ,  h(?xn))  and  the  term 
p(c, . . .  ,c),  with  the  equalities  c~  h(a)  and  c  =  h(b).  The  flattening  algorithm 
will  substitute  both  h(a )  and  h(b)  for  each  occurrence  of  c  in  the  term.  If 
h(a )  and  h(b)  are  not  variants,  then  the  exponential  number  of  terms  thus 
created  are  in  fact  all  necessary  for  completeness.  However,  if  h(a)  and  h(b) 
are  variants  (i.e.,  because  a  =  b),  all  the  terms  created  would  be  equivalent 
and  any  one  would  suffice.  The  algorithm  is  thus  much  less  t  han  optimal  in 
this  (relatively  pathological)  case. 

4.5  Termination 

Any  pattern -directed  invocation  system  (even  without  equality)  has  the 
problem  that  it  is  possible  to  write  demons  that  cause  the  system  to  go  into  an 
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Initial  data  base: 


Rule  h(h(?x))  =>  h(h(1x))  =  lx 


h{h(a)) 


•  Ratten(h(h(?x)),body) 

t>  adds  (  /i(?x)  ,  body'  ) 

where  body '  is  \tB  .  flatten(substitute(/i(/i(?x)),  B),  body) 

•  h(a)  matches  h(?x)  with  bindings  {?x  «—  a} 

>  body'  applied  to  h(a) 

t>  flatten (h(h(a)),  body) 

>  adds  (  h(h(a))  ,  body  ) 


•  h(h(a))  matches  h(h(a)) 

t>  body  applied  to  h(h(a)) 

>  asserts  h(h(a))  =  a 

•  h(h(a))  matches  h(?x)  with  bindings  {?x 
>  body'  applied  to  h(h(a)) 

>  flatten(/i(/i(/i(a))),  body) 

o  adds  (  h(h(h(a)))  ,  body  ) 


h{a)} 


•  h(a)  matches  h(h(h(a)))  [since  a  =  h(h(a))]  yielding  h(h(h(a))) 

>  body  applied  to  h(h(h(a))) 

>  asserts  h(h(h(a)))  =  h(a) 

Figure  7.  A  trace  of  the  process  by  which  a  demon  implementing  the  involutive 
law  for  h  is  invoked  in  an  example  situation.  Note  that  this  example  involves  the 
circular  equality  h(h(a))  =  a. 
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h(h(a))} 


•  h(h(h(a)))  matches  /i(?x)  with  bindings  {?x 

c>  6odt/  runs  on  h(h(h(a))) 

>  add-demon (h(h(h(h(a)))),  body) 

>  installs  (  h(h(h(h(a))))  ,  body  ) 


•  h(h(a))  matches  h(h(h(h(a))))  [since  h(a)  =  h(h(h(a)))\  yielding  h(h(h(h{a)))) 

t>  body  runs  on  h(h(h(h(a)))) 

t>  asserts  h(h(h(h(a))))  =  h(h(a)) 


Figure  8.  If  matching  variants  created  in  try-w-equalities  were  added  to  the  data 
base,  then  the  example  trace  in  Figure  7  would  continue  without  terminating,  as 
shown  above. 

infinite  loop.  For  example,  when  the  term  h(a)  is  added  to  the  data  base,  the 
following  apparently  straightforward  demon  implementing  the  involutive  law 
will  create  the  terms  h(h(a )),  h(h(h(a ))),  h(h(h(h(a) ))),  and  so  on  without 
end:10 

Rule  h(?x)  =$•  k(h(?x))  =  ?x. 

The  problem  with  this  demon  is  that  it  creates  a  new  term  in  its  body 
that  matches  its  own  pattern.  Sometimes  several  demons  can  conspire  to 
the  get  same  effect,  making  the  problem  much  less  obvious.  It  is  the  user’s 
responsibility  to  refrain  from  writing  these  kinds  of  demons. 

Non-terminating  demons  can  often  be  rewritten  so  that  they  terminate. 
For  example,  Figure  7  shows  the  version  of  the  involutive  law  used  in  FRAPPR. 
in  which  a  larger  pattern  is  used  to  avoid  the  termination  problem. 

The  example  in  f  igure  7  also  involves  circular  equalities  and  illustrates 
why  it  is  important  that  matching  variants  (created  in  line  21  of  Figure  2) 

10This  kind  of  non-terminating  behavior  is  often  called  “counting”,  because  it  reminds 
us  of  the  Peano  axiomitization  of  the  natural  numbers. 
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are  not  added  to  the  data  base.  As  can  be  seen  in  Figure  8,  if  the  matching 
variant  h(h{h(a)))  created  at  the  end  of  Figure  7,  were  added  to  the  data  base, 
it  would  successfully  match  against  the  pattern  h(?x),  eventually  causing  the 
matching  variant  h(h(h(h(a))))  to  be  created,  and  so  on  ad  infinitum. 

Despite  the  fact  that  it  is  always  possible  to  write  non-terminating  demons, 
we  do  want  to  make  sure  that  our  extensions  to  the  standard  algorithm  do 
not  introduce  any  additional  non-terminating  behavior.  One  way  of  saying 
this  more  precisely  is  to  prove  that  if  all  demons  defined  by  the  user  have 
empty  bodies,  then  the  system  will  always  terminate. 

For  flat  patterns,  termination  is  obvious.  There  are  a  finite  number  of 
demons  and  a  finite  number  of  terms  in  the  data  base.  The  algorithm  can,  at 
worst,  create  one  matching  variant  for  each  pair  of  term  and  demon.  Since 
the  bodies  of  the  demons  are  empty,  and  add-term- w-equality  is  not  called 
on  the  matching  variant,  the  system  stops  when  all  of  these  variants  have 
been  created. 

For  nested  patterns,  we  have  to  worry  about  the  intermediate  demons 
created  by  the  flattening  process.  All  of  these  demons  have  flat  patterns, 
but  their  bodies  are  not  empty.  Non-terminating  behavior  could  result  if  the 
execution  of  the  bodies  created  new  demons  ad  infinitum.  We  can  see  this 
will  not  be  the  case,  however,  by  observing  that  if  there  are  variables  in  the 
pattern  p  of  an  intermediate  demon,  then  the  body  of  the  demon  is  of  the 
form 

XtB .  flatten(substitute(p,  B ),  body), 

where  body  is  the  original  empty  body.  When  a  demon  of  this  form  is  ex¬ 
ecuted,  the  substitution  of  bindings  B  in  the  pattern  p  must  reduce  the 
number  of  variables  in  the  pattern.  Eventually,  a  sequence  of  such  calls  must 
eliminate  all  of  the  variables  in  any  pattern.  When  that  happens,  the  call 
to  flatten  will  go  directly  to  the  flat  case  and  install  a  demon  whose  pattern 
has  no  variables  and  whose  body  is  the  original  empty  body,  at  which  point 
the  process  terminates. 


4.6  Improvements  to  the  Algorithm 

This  section  describes  two  extensions  to  the  algorithm  in  Figure  2,  which 
improve  its  performance.  First,  under  certain  circumstances,  the  same  demon 
may  be  applied  more  than  once  to  the  same  term.  This  problem  is  remedied 
by  the  addition  of  some  indexing  information  on  each  term.  Second,  the 
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1  function  try-w-equality (t,p,  d) 

J  Match  term  t  against  pattern  p  with  equalities 
and  apply  body  d  to  t  or  matching  variant,  if  successful.  ]] 

2  B  *—  match- w-equality(p,  t,  0) 

3  if  match  succeeded 

4  then  v  <—  the-term(substitute(p,  B)) 

5  if  ( v  is  not  in  data  base  and  d  is  not  in  demon  list  of  v) 

6  or  (v  =  t  and  not  joining  equality  classes) 

7  then  apply  d  to  v  [and  B\ 

8  if  v  ^  t  then  add  d  to  demon  list  of  v 

Figure  9.  An  improved  version  of  try-w-equality  (see  Figure  2),  in  which  no 
demon  is  applied  more  than  once  to  the  same  term.  A  list  has  been  introduced  in 
lines  5  and  8  to  keep  track  of  the  demons  applied  to  a  given  term  while  it  is  not 
in  the  data  base. 

non-uniqueness  of  closest  matching  variants  causes  the  algorithm  to  create 
unnecessary  variants  under  certain  circumstances.  This  problem  is  alleviated 
by  the  addition  of  an  ordering  on  terms. 

The  circumstances  under  which  the  same  demon  (  p  ,  d  )  is  applied  to  the 
same  term  v  more  than  once  are  as  follows.  Suppose  that  v  is  the  matching 
variant  created  by  try-w-equality(f,p, d),  where  t^v.  Try-w-equality  applies 
d  to  v,  but  does  not  add  v  to  the  data  base.  It  is  possible  for  the  same  term 
v  to  be  created  by  a  different  call,  try-w-equality(f',  p,  d),  where  t^t'  ^v.u 
In  this  situation,  d  will  be  applied  again  to  v.  Alternatively,  suppose  that 
v  is  now  explicitly  added  to  the  data  base,  i.e.,  add-term-w-equality(u)  calls 
try-w-equality(u,  p,  d).  Again,  d  will  be  applied  to  v. 

The  problem  here  is  that  we  don’t  know  what  demons  have  been  applied 
to  a  term  while  it  is  in  the  “limbo”  state  (i.e.,  created  but  not  in  the  data 
base).  This  problem  is  straightforwardly  remedied  by  keeping  a  list  of  the 
demons  applied  to  a  given  term  while  it  is  in  this  state.  This  list  can  be 
cleared  and  the  memory  regained  in  add-term-w-equalitv,  when  the  term  is 
added  to  the  data  base. 

Another  case  where  the  same  demon  may  be  applied  to  the  same  term 
11  For  example,  let  p  be  h(?x),  v  be  h(a),  t  be  j(a),  and  t'  be  k(a),  with  h  =  j  =  k. 
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more  than  once  is  when  joining  two  equality  classes.  In  that  case,  all  demons 
have  been  applied  to  all  matching  terms  in  the  data  base,  and  the  reason  for 
re-matching  is  only  to  create  new  variants  if  appropriate.  In  that  case  there 
is  therefore  no  need  to  apply  the  demon  to  the  term  t  even  if  it  matches  the 
demon’s  pattern. 


The  improved  version  of  try-w-equality  with  these  additions  is  shown  in 
Figure  9.  Note  that  the  test  v£t  in  line  8  checks  for  when  try-w-equality 
is  called  from  add-term-w-equality  and  t  is  being  added  to  the  data  base,  in 
which  case  there  is  no  point  in  updating  the  demon  list  of  t. 


A  second  problem  with  the  algorithm  in  Figure  2  is  it  can  create  more 
matching  variants  than  necessary  for  patterns  with  multiple  occurences  of  a 
given  variable.  For  example,  consider  matching  the  term  /(a,  6)  against  the 
pattern  /(?x,?x)  with  a  —  b.  Both  f(a,a)  and  f(b,b)  are  closest  matching 
variants  of  /(a,  b)  with  respect  to  this  pattern.  Due  to  the  order  of  matching, 
the  algorithm  in  Figure  2  arbitrarily  chooses  to  create  f(a,a).  Now  suppose 
that  the  term  /(&,  a)  is  added  to  the  data  base.  This  term  has  the  same 
two  closest  matching  variants  with  respect  to  /(?x,?x),  but  in  this  case  the 
order  of  matching  in  the  algorithm  chooses  to  create  /(6, 6).  Since  /(a,  a) 
and  /(6,  6)  are  equivalent  with  respect  to  the  pattern  /(?x,  ?x),  and  both 
are  variants  of  both  /(a,  b)  and  /(&,  a),  only  one  is  required  to  guarantee 
completeness. 


This  problem  can  be  alleviated,  with  a  small  increase  in  the  complexity  of 
the  matching  algorithm,  by  introducing  an  arbitrary  (e.g.,  lexicographic  or 
timestamp)  order  on  terms.  When  the  matcher  has  a  choice  of  bindings  for 
a  variable,  it  always  chooses  the  smallest  term  in  the  order  (see  Figure  10). 
Thus  in  the  example  above,  if  lexicographic  order  is  used,  f(a,a)  will  be 
chosen  as  the  closest  matching  variant  for  both  f(a,b)  and  f(b,a).  Note  that 
as  long  as  closest  matching  variants  are  used,  it  will  sometimes  be  necessary 
to  create  more  than  one  variant  from  each  equivalence  class.  For  example,  if 
a,  b,  c  and  d  are  all  equal,  there  is  no  one  term  which  is  closest  to  both  /(a,  6) 
and  f(c,d)  with  respect  to  the  pattern  /(?x,?x).  See  however  Section  5.1, 
where  an  alternative  to  using  closest  matching  variants  is  discussed. 
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1  function  match- w-equality(p,  t,  B) 

[  Match  pattern  p  to  term  t  with  bindings  B  with  equalities. 
Complete  for  Bat  patterns  only.  ] 

2  case 

3  p  is  a  variable: 

4  if  p  is  bound  to  s  in  B 

5  then  if  s  =  t 

6  then  II  same  value  ] 

7  if  t  -<  s  then  replace  binding  of  p  in  B  by  t 

8  return  B 

9  else  fail  [  different  value  J 

10  else  return  BU{p<-t}  |  new  binding  J 


Figure  10.  An  improved  version  of  the  matching  algorithm  from  Figure  2.  Line  7 
has  been  added  to  guarantee  that  the  matcher  chooses  the  same  matching  variant 
for  variant  inputs,  “t  -<  s"  means  t  is  less  than  s  in  some  order. 
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5  Conclusion 

5.1  Alternate  Approaches 

The  main  goal  of  the  algorithm  developed  in  this  paper  has  been  to  guar¬ 
antee  the  completeness  of  pattern-directed  invocation  without  creating  the 
potentially  infinite  set  of  all  possible  variants.  As  part  of  the  development, 
however,  a  number  of  decisions  have  been  made  trading  off  between  time 
(matching  and  demon  execution)  and  space  (term  creation).  These  decisions 
have  been  based  on  the  relative  cost  of  various  computations  in  our  imple¬ 
mentation  environment,  and  our  expectations  of  use,  such  as  the  relative 
frequency  of  additions  to  the  data  base  versus  retractions.  In  this  section, 
we  briefly  sketch  two  alternate  decisions  that  could  reasonably  be  made. 

In  Section  4.1,  it  was  suggested  that,  when  match-w-equality  succeeds, 
rather  than  sometimes  creating  a  matching  variant,  one  might  always  apply 
the  given  demon  to  the  given  term.  For  example,  the  term  /(a,  b),  with  a  —  0, 
successfully  matches  against  the  demon 

(  f(0,lx)  ,  At.assert(<  >  0)  ), 

returning  the  bindings  {?£  <—  6}.  Rather  than  creating  the  new  term  /( 0,  ft), 
one  might  consider  applying  the  demon  body  to  f{a,b)  directly,  causing  it 
to  assert  f(a,b)  >  0,  which  is  the  desired  conclusion  in  this  case. 

The  problem  with  this  approach  is  that  the  proper  dependencies  are  not 
installed.  The  conclusion  f{a,b)  needs  to  be  retracted  when  the  equality  a— 0 
is  retracted.  In  order  to  make  this  approach  work  correctly,  we  would  need 
to  supply  the  body  of  each  demon  with  with  the  set  of  equalities  required  to 
make  the  match  succeed.12  Any  conclusions  made  by  the  body  would  then 
have  to  depend  on  these  equalities. 

The  attraction  of  this  approach  is  that  it  can  reduce  the  number  of  terms 
created.  On  the  other  hand,  it  can  increase  the  number  of  demons  applied. 
For  example,  suppose  that  in  the  situation  above  it  is  also  the  case  that 
c  =  0,  and  the  term  /(c,  6)  is  in  the  data  base.  The  algorithm  developed  in 
this  paper  will  create  the  term  /(0,  ft),  which  is  the  closest  matching  variant  to 
both  /(a,  ft)  and  /(c,  ft),  and  apply  the  demon  to  it,  asserting /(0,  ft)  >  0.  Both 
desired  conclusions,  namely  /(a,  ft)  >  0  and  /(c,  ft)  >  0  follow  by  substitution 

,2In  the  equality  system  of  BREAD,  it  is  very  cheap  to  test  if  two  terms  are  in  the  same 
equality  class.  It  is  much  more  expensive,  however,  to  compute  the  supporting  set  of 
equalities. 
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of  equals.  The  alternate  approach  suggested  here  will  apply  the  demon  twice, 
once  to  f(a,b)  and  once  to  f(c,b)  to  get  the  same  conclusions. 

Another  way  to  reduce  the  number  of  terms  created,  mentioned  in  Sec¬ 
tion  4.3,  is  to  give  up  using  the  closest  matching  variant.  For  example,  con¬ 
sider  matching  the  term  in  the  data  base  /(a, 6)  against  the  pattern  /( 0,?x) 
with  a  =  0.  The  closest  matching  variant  in  this  case  is  /(0,6) — if  it  doesn't 
already  exist,  the  algorithm  developed  above  will  create  it.  However,  suppose 
the  term  /(0,c)  already  exists  (in  the  data  base  or  in  “limbo”),  with  b  =  c. 
This  term  is  also  a  variant  of  f(a,  b )  matching  the  pattern.  Since  all  variants 
are  equivalent  in  the  flat  case,  completess  is  satisfied  as  long  as  the  demon  is 
applied  to  /(0,c).  The  term  / (0, 6)  does  not  need  to  be  created. 

The  problem  with  this  approach,  of  course,  is  that  when  equality  classes 
are  split  (e.g.,  when  b  =  c  is  retracted),  re-matching  needs  to  be  done,  similar 
to  when  equality  classes  are  merged.  At  that  time,  another  existing  match¬ 
ing  variant  may  be  found,  or  new  term  may  need  to  be  created.  Whether 
this  trade-off  is  advantageous  depends  on  the  relative  frequency  of  additions 
versus  retraction. 

Finally,  a  natural  question  to  ask  is  whether  the  algorithm  developed 
here  can  be  generalized  to  unification,  i.e.,  allowing  terms  in  the  data  to  con¬ 
tain  variables  (under  the  usual  convention  that  free  variables  are  universally 
quantified).  Although  the  matcher  could  be  straighforwardly  generalized  to 
do  unification,  it  turns  out  not  to  be  of  much  use. 

To  illustrate,  suppose  that  the  data  base  contains  the  term  /(?»,?<’)■ 
This  term  would  unify  with  the  pattern  of  the  following  demon: 


Rule  f(?x/!x) 


f(?x,?x)  =  ?  x . 


However,  creating  the  unifying  term  /(?u,?u)  and  applying  the  demon  to 
it  would  cause  /(?u,?u)  =  ?u  to  be  asserted,  which  doesn’t  add  any  new 
information  about  the  original  term  /(?u,?u).  The  conclusion  to  be  made 
here  is  that,  in  this  setting,  logical  variables  and  pattern  variables  are  different 
entities. 

While  on  the  topic  of  unification,  it  is  also  worth  pointing  out  related 
research  on  so-called  “generalized  unification”  [9].  The  problem  studied  in 
this  work  is  unification/matching  in  the  presence  of  implicit  equalities  due  to 
some  fixed  theory,  such  as  commutativity  or  associativity.  In  contrast,  our 
work  is  concerned  with  matching  in  the  context  of  an  explicit  data  base  of 
arbitrary,  changing  equalities. 
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5.2  Summary 

Pattern-directed  invocation  in  the  presence  of  arbitrary  equalities  raises  the 
completeness  problem  addressed  in  this  paper.  If  the  equalities  are  fixed 
and  patterns  are  flat,  the  changes  that  need  to  be  made  to  the  standard 
pattern-directed  invocation  algorithm  are  relatively  straightforward.  Chang¬ 
ing  equalities  and  nested  patterns  introduce  most  of  the  complexity  of  the 
algorithm  developed  here. 

The  algorithm  described  here  has  been  implemented  in  a  working  sys¬ 
tem  called  BREAD.  This  system  has  proved  to  be  a  useful  foundation  upon 
which  we  have  built  a  frame  system  [2]  and  a  system  for  reasoning  about 
programs  [7], 
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